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Abstract 

A well-studied nonlinear extension of the minimum-cost flow problem is to minimize the 
objective J2ij^E ^vifij) '^^^^ feasible flows /, where on every arc ij of the network, Cij is a 
convex function. We give a strongly polynomial algorithm for finding an exact optimal solution 
for a broad class of such problems. The key characteristic of this class is that an optimal solution 
can be computed exactly provided its support. 

This includes separable convex quadratic objectives and also certain market equilibria prob- 
lems: Fisher's market with linear and with spending constraint utilities. We thereby give the 
first strongly polynomial algorithms for separable quadratic minimum-cost flows and for Fisher's 
market with spending constraint utilities, settling open questions posed e.g. in |15^ and in |35j . 
respectively. The running time is 0(m^ logm) for quadratic costs, 0(n^ + n'^ (m + nlogn) logn) 
for Fisher's markets with linear utilities and 0(mrv' -\- m? (m + n log n) log m) for spending con- 
straint utilities. 

1 Introduction 

The flow polyhedron is defined on a directed network G = {V, E) by arc capacity and node demand 
constraints. For feasible solutions /, we aim to minimize Ylij&E ^ijifij)^ where on each arc ij € E, 
Cij is a diff^erentiable convex function. Assume that each dj can be described by some numerical 
parameters (for example, if Cij{a) = CijO^ + dija, then it is described by the values Cij,dij). By 
a strongly polynomial algorithm for such a problem, we mean the following: (i) the algorithm 
uses only elementary arithmetic operations and comparisons; (ii) the number of these operations is 
bounded as a polynomial of the number of nodes (n) and arcs of the network (m); (in) if the input 
consists of rational numbers, then the size of all numbers during the computations is polynomially 
bounded in n, m, and the size of the input numbers. 

Flows with separable convex objectives are natural convex extensions of minimum-cost fiows 
with several applications as matrix balancing or traffic networks, see [H Chapter 14] for further refer- 
ences. Polynomial-time combinatorial algorithms were given by Minoux [25] in 1986, by Hochbaum 
and Shantikumar [17J in 1990, and by Karzanov and McCormick [21] in 1997. The latter two 
approaches are able to solve even more general problems of minimizing a separable (not necessar- 
ily differentiable) convex objective over a polytope given by a matrix with a bound on its largest 
sub determinant. Both approaches give polynomial, yet not strongly polynomial algorithms. 

In contrast, for the same problems with linear objectives, Tardos [33. ,34j gave strongly polyno- 
mial algorithms. One might wonder whether this could also be extended to the convex setting. This 

'College of Computing, Georgia Institute of Technology. Supported by NSF Grant CCF-0914732. E-mail: 
veghal@cs.elte.hu . 



1 



seems impossible for arbitrary convex objectives by the very nature of the problem: the optimal 
solution might be irrational, and thus the exact optimum cannot be achieved. 

Beyond irrationality, the result of Hochbaum [15j shows that it is impossible to find an e- 
approximate solution in strongly polynomial time even for a network consisting of parallel arcs 
between a source and a sink node and the Cjj's being polynomials of degree at least three. This is 
based on Renegar's |30j result showing the impossibility of finding e-approximate roots of polyno- 
mials in strongly polynomial time. 

The remaining class of polynomial objectives with hope of strongly polynomial algorithms 
is convex quadratic. For these functions, the existence of a rational optimal solution is always 
guaranteed. Granot and Skorin-Kapov [12] extended Tardos's method jS^ to solving separable 
convex quadratic optimization problems where the running time depends only on the entries of 
the constraint matrix and the coefficients of the quadratic terms in the objective. However, in a 
strongly polynomial algorithm, the running time should only depend on the matrix. 

The existence of a strongly polynomial algorithm for the quadratic flow problem thus remained 
an important open question (mentioned e.g. in [l5l HI [HI [I2l [32j). The survey paper [16] gives 
an overview of special cases solvable in strongly polynomial time. This includes fixed number of 
suppliers (Cosares and Hochbaum, [4J), and series-parallel graphs (Tamir [32]). We resolve this 
question affirmatively, providing a strongly polynomial algorithm for the general problem in time 
0{m^ logm). 

There is an analogous situation for convex closure sets: [15] shows that no strongly polynomial 
algorithm may exist in general, but for quadratic cost functions, Hochbaum and Queyranne gave a 
strongly polynomial algorithm |14j . 

An entirely different motivation of our study comes from the field of market equilibrium al- 
gorithms. Devanur et al. [5] developed a polynomial time combinatorial algorithm for a classical 
problem in economics. Fisher's market with linear utilities. This motivated a line of research to 
develop combinatorial algorithms for other market equilibrium problems. For a survey, see |27[ 
Chapter 5] or [36]. All these problems are described by rational convex programs. For the linear 
Fisher market problem, a strongly polynomial algorithm was given by Orlin [29]. 

To the extent of the author's knowledge, this field has been considered so far as an entirely 
new domain in combinatorial optimization. An explicit connection to classical flow problems was 
pointed out in the recent paper [37]. It turns out that the linear Fisher market, along with several 
other problems, is captured by a concave extension of the classical generalized flow problem, solvable 
by a polynomial time combinatorial algorithm. 

The paper [37] uses the convex programming formulation of linear Fisher markets by Eisenberg 
and Gale [7]. An alternative convex program for the same problem was given by Shmyrev |31j. 
This formulation turns out to be a convex separable minimum-cost flow problem. Consequently, 
equilibrium for linear Fisher market can be computed by the general algorithms [17] or |21j (with 
a final transformation of a close enough approximate solution to the exact optimal one). 

The class of convex flow problems solved in this paper also contains the formulation of Shmyrev, 
yielding an alternative strongly polynomial algorithm for linear Fisher market. This formulation 
can be extended (see Devanur et al. [2]) to spending constraint utilities, a market defined by 
Vazirani p5]. For this problem, we obtain the first strongly polynomial algorithm. Our running 
time bounds are 0(n^ + n?{m + n log n) log n) for linear and 0{mrfi + m?{m + n log n) log m) for 
spending constraint utilities, with m being the number of segments in the latter problem. Orlin 
[29| used the assumption m = 0{n'^) and achieved running time O(n^logn), the same as ours 
under this assumption. So far, no extensions of p9] are known for other market settings. Using 
the general framework enables a simpler treatment as in [29j . 
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1.1 Prior work 



For linear minimum-cost flows, the first polynomial time algorithm was the scaling method of 
Edmonds and Karp [6]. The current most efficient strongly polynomial algorithm, given by Orlin 
[28], is also based on this framework. On the other hand, the algorithm of [6] has also been extended 
to the convex minimum-cost flow problem, first by Minoux [24J to convex quadratic flows, later to 
general convex objectives (Minoux [25], Hochbaum and Shantikumar [17J). Our algorithm is an 
enhanced version of these later algorithms, in the spirit of Orlin's technique [28j. However, there 
are important differences that make the nonlinear setting significantly harder. Let us remark that 
Orlin's strongly polynomial algorithm for linear Fisher market [29] is also based on the ideas of 
[28] . To get an overview on linear and nonlinear minimum-cost flow algorithms, we refer the reader 

tod]. 

The algorithm of Edmonds and Karp consists of A-phases, and the scaling factor A decreases 
at least by a factor of two for the next phase. The elementary step of the A-phase transports A 
units of flow from a node with excess at least A to another with demand at least A. This is done 
on a shortest path in the A-residual network, the graph of residual arcs with capacity at least A. 
An invariant property maintained in the A-phase is that the A-residual network does not contain 
any negative cost cycles. When moving to the next phase, the flow on the arcs has to be slightly 
modified to restore the invariant property. 

Orlin's algorithm [28J works on a problem instance with no upper capacities on the arcs (every 
minimum-cost flow problem can be easily transformed to this form). The basic idea is that once an 
arc ij has flow at least 4nA in the A-phase, then the flow on ij must be positive in some optimal 
solution. Such an arc is called abundant. Using primal-dual slackness, this means that ij must be 
tight for an arbitrary dual optimal solution. It can be shown that within O(logn) scaling phases, 
an abundant arc ij appears. 

Based on this observation, one can obtain the following simple algorithm. Let us contract ij, 
and restart the Edmonds-Karp algorithm in the smaller graph. Iterating this method, we can find 
an optimal dual solution, which easily enables to compute an optimal primal solution. Instead of 
explicitly contracting and restarting, [28] (see also [T| Chapter 10.7]) continues the scaling method 
after finding an abundant arc, and maintains the forest F of such arcs. A new arc enters F in 
O(logn) phases, and the running time can be bounded by 0(m log n(m + n log n)), so far the most 
efficient minimum-cost flow algorithm known. 

Let us now turn to the nonlinear setting. The Edmonds-Karp algorithm naturally extends here 
(see [241 [25] [T7]). In the A-phase, the invariant is that the A-residual graph contains no negative 
cycle for the linearization of the cost function into A-chunks. That is, the cost of the arc ij is 
defined as (Cjj(/ij + A) — Cij{fij))/ A. When transporting A-units of flow on a shortest path with 
respect to this cost function, the invariant is maintained. A key observation is that when moving 
to the A/2-phase, the invariant can be restored by changing the flow on each arc by at most A/2. 
As A approaches 0, the cost of ij converges to the derivative C[j{fij). By the KKT conditions, a 
flow / is optimal if and only if the residual graph contains no negative cycles for the derivatives. 
A variant of this algorithm is outlined in Section [3] 

1.2 Our results 

The class of problems where we give a strongly polynomial algorithm is defined by four assumptions. 
We need to have an oracle access to the derivatives of the Cjj's by Assumption [1] The technical 
Assumption [2] restricts the problem instances where each cost functions is either strictly convex or 
linear. 
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Two main ingredients of the algorithm are black box subroutines provided by Assumptions [3] 
and m The first subroutine returns a solution that satisfies optimally the KKT-conditions on a 
subset of arcs. In particular, it returns an optimal solution if the input is the support of an optimal 
solution. For quadratic cost functions and for the Fisher markets, this can be implemented by 
solving simple systems of equations. The second is a technical subroutine measuring the quality of 
a candidate solution, returning its distance from optimality in a certain metric. This subroutine 
appears to be the bottleneck in complexity: for quadratic costs, it needs a minimum cost-to-time 
ratio cycle computation (also known as the tramp steamer problem). For the Fisher markets, an 
all pairs shortest paths computation is performed. The main result of the paper is the following. 

Theorem 1.1. Let Assumptions 1-4 hold for the problem of minimizing "^-^^ Cij{fij) over feasible 
flows f in a network on n nodes and m arcs, among them having nonlinear cost functions. Let 
PT{n,rn) and pE{n,rn) denote the running time of the subroutines defined in Assumptions\^and\^ 
and let ps{n,m) be the running time needed for a single shortest path computation. Then an exact 
optimal solution can be found in 0{{n + mM){pT{n,m) + pE{n,m)) + (n + tun)'^ ps{n,m) log m) 
time. 

This gives an 0(m^ log m) algorithm for quadratic convex objectives. For Fisher markets, we 
obtain 0(n^ + n^ijn + n log n) log n) running time for linear and 0{mn^ + m?(m + n log n) log m) 
for spending constraint utilities. 

We shall now outline the basic ideas of our strongly polynomial algorithm. The problem is 
first transformed to an instance with no upper bounds on the arcs. All cost functions Cij are 
assumed to be differentiable. From a complexity perspective, cost functions are provided via oracles 
(Assumption d]) , as in [17] and [2T] . 

For simplicity of presentation, let us now assume that there exists a unique optimal solution 
/*. We shall assume that /* can be computed exactly (in particular, it is rational). Let F* denote 
the set of arcs ij with f*j > 0. By the KKT conditions, ttj — -Ki = C[j{f*j) for each ij G F* and for 
the Lagrangian multipliers vr. Assumption [3] guarantees that if we can somehow guess the set F* , 
then we can obtain the optimal /* by solving the above equality system. 

Our starting point is the nonlinear scaling scheme as described above, with the only difference 
that the linearized cost function is replaced by C^j(/jj + A). This has similar properties but is 
easier to handle. As in [28j, we can identify an abundant arc ij in O(logn) steps, which must 
be contained in F* . However, contraction does not work. The reason is that the KKT-conditions 
give TTj — TTj = Clj{f*j), a condition containing both primal and dual (more precisely, Lagrangian) 
variables simultaneously. 

In each phase of the algorithm, we shall maintain a set F C F* of arcs, called revealed arcs. F 
will be extended by a new arc in every O(logn) phases; thus we find F* in 0(m log n) steps. We 
change the terminology from abundant as they will not necessarily carry a huge amount of flow. 
Indeed, for revealed arcs, we delete the lower capacity and even allow the flow value to become 
negative. Accordingly, we keep both ij and ji for ij G in the set of residual arcs. 

The second important difference is that in contrast to Orlin's algorithm, we cannot assume 
that F is acyclic: there are simple quadratic instances with the support of an optimal solution 
containing cycles. In Orlin's algorithm, progress is always made by connecting two components of 
F. This will also be an important event in our algorithm, but sometimes F must also be extended 
with arcs inside a component. 

At the end of each phase when F is extended, we compute the discrepancy of F, the maximum 
absolute value of the sum of node demands in a component of F. If this value is large, then it can 
be shown that F will be extended within O(logn) phases as in Orlin's algorithm. 
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If the discrepancy is small, the procedure Trial-and-Error is performed. We pretend that 
F = F* and try to compute an optimal solution / under this hypothesis, which can be done by a 
subroutine guaranteed by Assumption [3l If F C i*"*, then / is not optimal. Another subroutine, 
described in Assumption HI enables us to compute the smallest A value for which / is A-feasible 
(that is, it satisfies the necessary conditions for a A-phase). If A < A/2, then we set A as our 
next scaling value and / = / as the next pseudoflow. Otherwise, the usual transition to phase 
A/2 is done with keeping the same flow /. The analysis shows that a new arc shall be revealed in 
every O(logre) phases. The key lemma is a proximity result between / and /, which implies that 
Trial-and-Error cannot return the same / if performed again after O(logn) phases, implying 
that the set F cannot be the same. 

The paper is organized as follows. Section [2] contains the basic definitions and notation. Sec- 
tion [3] presents the simple adaptation of the Edmonds-Karp algorithm for convex cost functions, 
following Minoux [25] . Our algorithm in Section H] is built on this algorithm with the addition of 
the subroutine Trial-and-Error, that guarantees strongly polynomial running time. Analysis is 
given in Section [5j Section [6] adapts the general algorithm for quadratic utilities, and for Fisher's 
market with linear and with spending constraint utilities. Section [7] contains a final discussion of 
the results and some open questions. 

2 Problem definitions 

Let G = (y,E) be a directed graph possibly containing parallel arcs. Let n = \V\, m = \E\. 
We are given lower and upper capacities u :£'—)■ M U {oo} on the arcs, and node demands 
6 : y — )• M with X^jgy bi = 0. On each arc ij G Cij : (— oo, oo) — )• M U {oo} is a convex function. 
Either it is differentiable everywhere, or it is oo on {—oo, iij) and is differentiable on (£jj,oo). The 
minimum-cost flow problem with separable convex objective is defined as follows. 

min ^ Cij{xij) 
ijeE 

j:ji€E j:ij€E 

(^ij < fij < Uij yij G E 

Throughout the paper, we shall work with the uncapacitated version, that is, ^ = and u = oo. 
With a standard method, every problem can be transformed to an equivalent uncapacitated form. 
Indeed, let us replace each arc ij £ E hy a path ikj of length 3, by introducing a new node k. 
Let us set bk = Uij — iij, Cik{a) = Cij{a + Ckj = 0. Furthermore, let us increase bi by iij 
and decrease bj by Uij. It is easy to see that this gives an equivalent optimization problem, and if 
the original graph had n' nodes and m' arcs, the transformed instance has n = n' + ml nodes and 
m = 2m' arcs. 

By a pseudoflow we mean a function f : E ^ M satisfying the capacity constraints. For the 
uncapacitated problem, it simply means / > 0. Let 

j:ji€E j-ijeE 

and let Ex{f) = Exb{f) = X^iey max{/jj(i) — 6j,0} denote the total positive excess. For an arc 
set F, let F denote the set of backward arcs and let We shall use the vector norms 

||x||oo = max \xi\ and = 
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Following [T7] and [21], we do not require the functions Cij to be given explicitly, but assume 
oracle access only. 

Assumption 1. For each arc ij, we are given a differential oracle that returns the value Clj{a) in 
0(1) time for any q € M. 

Note that we do not assume an evaluation oracle returning Cjj(a). The main algorithm needs 
only access to a differential oracle; however, the subroutines described in Assumptions H] and [3] may 
need stronger oracles. For quadratic objectives, the derivatives are linear functions, however, for the 
market applications, we will have logarithmic derivatives. As we shall argue in Section [6.21 in those 
settings, access is provided to the exact values e'^*J^°\ which is also sufficient for all applications of 
the oracle, most importantly, shortest path computations. Assumptions [H [3l and H] shall be given 
in Section HI 

Given a pseudoflow /, let us define the residual graph Ef by ij £ Ej if ij G E, or ji G E and 
fji > 0. Arcs of the first type are called forward, those of the latter type backward arcs. We use the 
convention that on a backward arc ji, fji = —fij, Cji{a) = Cij{—a), also convex and differentiable. 
The residual capacity is oo on forward arcs and fij on the backward arc ji. 

The Karush-Kuhn- Tucker conditions assert that the solution / to ([P]) is optimal if and only if 
there exists tt : V ^ M. such that 

TTj - TTi < C'ijif,,) yijeEf. (1) 

For a value A > 0, let Ef{A) denote the subset of arcs in Ef that have residual capacity at least 
A (in particular, it contains E). We say that the pseudoflow / is A-feasible, if Ef{A) contains no 
negative cycles with respect to the cost function C[j{fij + A). This is equivalent to the existence 
of a potential tt : ^ ^ M so that 

TTj - vTi < Clj {fij + A) ytj eEf{A). (2) 

If ji is a reverse arc, then this condition gives C[j{fij — A) < tTj — VTj. This is different (and 
weaker) than the condition in [25] and [IT], where {Cij {fij + A) — Cij {fij)) /A is used in the place 
oiCi^{fij + A). 

The following lemma shows that a 2A-feasible pseudoflow can be transformed to a A-feasible 
pseudoflow by small changes on the arcs. 

Lemma 2.1. // the pseudoflow f is 2A-feasible, then there exists a A-feasible pseudoflow f with 
fij — fij S {0, ±A} on each ij G E. Consequently, Ex{f) < Ex{f) + mA. 

Proof. Consider a potential vr satisfying ([2]) with / and 2A. We want to prove that modifying / 
on each arc by itA or will give a solution / satisfying ([2]) with A and the same vr. 

Let ij G E he a forward arc. If the condition is violated on ij, then C[j{fij + A) < Hj — vTj. 
However, by 2A-feasibility we know vTj — vTj < C[j{fij + 2A). These show that setting fij = fij + A 
satisfies ^ for both ij and ji, using that C'-j{fij - A) < C'-j{fij) = C[j{fij + A) < -Kj - vTj. 

In the sequel, assume that ([2]) holds for ij G E. If ji ^ Ej{A) or ([2]) holds for ji, then we set 
fij = fij. Assume now ji G Ej{A) but ([2|) is violated for ji, meaning tTj — TTi < C[j{fij — A). Let 
us set = - A. ij satisfies © by vr, - vr, < C(,.(/,, - A) < C[^{hj) = C[j{f^j + A). 

If ji G Ej{2A), then we have C^(/ij - A) = C^(/ij - 2 A) < vr, - vr^, and thus Q also holds 
for ji. Finally, if ji G Ej{A) - Ej{2A), then ji ^ Ef{A). □ 
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The subroutine Adjust(A) performs the simple steps of the proof. 

Finally, we may assume without loss of generality that G = {V, E) is strongly connected. Indeed, 
we can add a new node t with arcs vt, tv for any t> € with extremely high (possibly linear) cost 
functions on the arcs. This guarantees that an optimal solution shall not use such arcs, whenever 
the problem is feasible. We will also assume n < m. 

3 The basic algorithm 

Let us now outline the (weakly) polynomial algorithm by Minoux [25] . a simple extension of the 
Edmonds-Karp algorithm. 

We start with the Ag-feasible solution = with Ex{f^) < (2n + m)Ao. The algorithm 
consists of A-phases, starting with A = Aq, with A decreasing by exactly a factor of two between 
two phases. 

Each A-phase consists of a preprocessing part and a main part. In the first, Ao-phase, no 
preprocessing is needed. In each later phase, we start with a 2A-feasible pseudofiow /'. We 
perform Adjust(A) to obtain a A-feasible pseudofiow /. 

In the main part of phase A, let ^(A) = {i ^ V : Pf{i) — h > A} and T(A) = {i & V : 
Pf{i) - hi < -A}. As long as S'(A) ^ 0, r(A) ^ 0, send A units of fiow from a node s G S'(A) 
to a node t G r(A) on a shortest path in {/S) with respect to the cost function C-j{fij + A). If 
S'(A) = or T(A) = 0, we proceed to the next phase with scaling factor A/2. 

3.1 Analysis 

We omit the proof of the following two simple lemmas; their analogous counterparts for the enhanced 
algorithm will be proved in Section [5l 

Lemma 3.1. (i) In the A-phase, the pseudofiow is an integer multiple of A on each arc, and 
consequently, EJ{A) = EJ . 

(a) A-feasibility is maintained when augmenting on a shortest path. 

(Hi) At the beginning of the main part, Ex{f) < (2n + m)A, and at the end, Ex{f) < nA. 
(iv) The main part consists of at most 2n + m flow augmentation steps. 

Lemma 3.2. Let f he the pseudofiow at the end of the A-phase and f in an arbitrary later phase. 
Then \ \f — f'\\oo ^ (2f^ + m + 1)A. // fij > (2n + m + 1)A at the end of the A-phase, then this 
property is maintained in all later phases, and there exists an optimal solution f* with f*- > 0. 

4 The enhanced algorithm 

4.1 Revealed arc sets 

We investigate a slightly restricted class of functions Cjj . 

Assumption 2. Each cost function Cij{a) is either linear or strictly convex, that is, Clj{a) is 
either constant or strictly monotone increasing. 
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Arcs with Cij{a) linear are called linear arcs, the rest is called nonlinear arcs. Let mi and mjv 
denote their numbers, respectively. We use the terms linear and nonlinear for the reverse arcs as 
well. 

Let F* denote the set of arcs that are tight in every optimal solution. Formally, ij € F* if for 
all pairs (/, vr) satisfying ([1]), TTj — vTj = C[j{fij) holds. The next lemma shows that F* contains the 
support of every optimal solution. 

Lemma 4.1. Let f be an arbitrary optimal solution to and fij > for some ij S E. Then 
ij € F*. 

The proof needs the following notion, also used later. Let x,y : E Mhe two vectors. Let us 
define the difference graph Dx^y = {V,Ex^y) with ij G E^^y if Xij > yij or if Xji < yji. 



of Lemma \4-l\ Let /* be a different optimal solution, and consider potentials vr and vr* with both 
(/, vr) and (/*,vr*) satisfying ([T]). If /,*■ > 0, then ji € Ef* and thus vrj — vr* = C-j{f*j) must hold. 

Assume now f*j = 0. Consider the difference graph Djj*. Since fij > f*p it follows that 
ij G Efj*. Because of pj* = pf, Ejj* must contain a cycle C containing ij. For every arc ab G C, 
fab > fab ^^'^ thus ab G Ef* and ba G Ef. By 

0=Y,<-<< Y.^'Mb) and 

ab€C abi^C 

= - VTfe < CUfba) = -Yl C'abifab). 

ab€C abeC abeC 

Since each function Cab is convex, fab > fab gives C'^^{fab) > ^abifab)- above inequalities, 

equality must hold everywhere, implying vr* — vr* = C^j{f*j) as desired. □ 

We shall see that under Assumption [3l finding the set F* enables us to compute an optimal 
solution. In the basic algorithm, F = {ij G E : fij > (2n + m + 1)A} is always a subset of F* by 
Lemma |3.2[ Furthermore, once an arc enters F, it stays there in all later phases. Yet there is no 
guarantee (and it is in fact not true) that in the basic algorithm, F is extended in some number of 
steps polynomially bounded in n and m. We shall modify the algorithm in order to guarantee that 
within O(logn) phases, a new arc is guaranteed to enter F. 

In each step of the enhanced algorithm, there will be an arc set F, called the revealed arc set, 
which is guaranteed to be a subset of F* . We remove the lower capacity from arcs in F and allow 
also negative values here. 

Formally, a vector / : — > R is an F-pseudoHow, if fij > for ij G E — F (but it is allowed to 

be negative on F). For such an /, let us define EJ = Ef U If ij G F, then the residual capacity 
of ji is oo. We shall maintain an F-pseudofiow when F (1 F* is the set of revealed arcs in some 
phase of the algorithm. 

We say that / : — )• M is F -optimal, if it is an F-pseudoflow with Exb{f) = and there exists 
vr : y M with 

vr. -TT, <q^.(/i,) ^ij^Ej. (3) 

The definition of F* implies that any optimal /* is also F-optimal \i F Q F* . We shall prove that 
given an F-optimal solution, we can easily find an optimal solution as well. 

Lemma 4.2. Assume that for a subset F F* , an F -optimal solution f is provided. Then an 
optimal solution to 0j can be found by a maximum flow computation. Further, ij G F* whenever 
fij > 0. 
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Proof. Assume (/, vr) and (/,vf) both satisfy ([3]). We prove that (i) fij = fij whenever ij is a 
nonhnear arc; and (ii) ij is a hnear arc with fij ^ fij , then vTj — vTj = C'^j {fij ) = C'-j {fij ) = 7f j — Tfj . 

(^nj immediately imphes the second half of the claim as it can be applied for an arbitrary optimal 
/*. The proof uses the same argument as for Lemma l4.ll 

W.l.o.g. assume fij > fij for an arc ij, and consider the difference graph Dj-j. Since fij > fij, 
ij must be contained on a cycle C Ej j. For every arc ab ^ C, ab G EJ and ba € EJ follows 

(using f C EfnEf). By ©, 



= ^ n - VTa < ^ C'abUab) and 

ab^C abeC 



abeC afeGC afegC 

Now convexity yields C'^^{fab) = C'^^{fab) for all a6 G C. Assumption [2] implies that all arcs in 
C are linear, in particular, ij. This immediately proofs (i). To verify (ii), observe that all above 
inequalities must hold with equality. 

This suggests the following simple method to transform an F-optimal solution / to an optimal 
/*. For every nonlinear arc ij, we must have f*j = fij. On the set of linear arcs satisfying 
vTj — vTj = C'ij{fij), we can solve a feasible circulation problem with node demands being the same 
as the Pf{i) values, lower capacities and upper capacities oo. The feasible solutions are precisely 
the optimal solutions. Indeed, if /* is an optimal solution, then ([3]) and (ii) imply VTj — VTj = C'ij{fij) 
for all ij with /*• > 0. □ 

Let F be the set of revealed arcs in the A-phase. We maintain an F-pseudoflow /, but instead 
of A-feasibility, we require the following slightly stronger property. Let Ej{lS.) denote the set of 
arcs in E^ with residual capacity at least A. We say that the F-pseudoflow F is {A, F)- feasible, 
if Ej{A) contains no negative cycle with respect to the cost function C[j{fij + A). Equivalently, 
there exists a potential vr : y — >■ M so that 

TT, - vTi < C[j {f,j + A) yijGEf{A). (4) 

We shall also maintain that the set of revealed arcs, F is linear acylic, meaning that F does not 
contain any cycle of linear arcs. 



4.2 Subroutine assumptions 

Given the set F C F* of revealed arcs, we will try to find out whether F already contains the 
support of an optimal solution. This motivates the following definition. We say that the (not 
necessarily nonnegative) vector x : E M is F-tight, if Xij = whenever ij ^ F and there exists 
a potential vr : 1/ — ?> M with 

TTj -TTi = C'ij{xij) yij G F. (5) 

For example, any optimal solution is -F*-tight by Lemma l4.ll Notice that an i^-tight vector / is 
not necessarily F-optimal as ^ might be violated for arcs in E — F and also since Exb{f) > is 
allowed. On the other hand, an F-optimal set is not necessarily F-tight as it can be nonzero on 
E-F. 

Given F and some node demands b : V ^ M., we would like to find an F-tight x with Ex^{x) = 0. 
Let us define the discrepancy D^{F) of F as the maximum of | X^ig/f o^^'^ undirected connected 
components K of F. A trivial necessary condition for solvability is D^{F) = 0. 
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Assumption 3. Assume we have a subroutine Trial (F, b) so that for any linear acyclic F E 
and any vector b : V ^ M satisfying Dy{F) = 0, it delivers an F-tight solution x with Ex^{x) = 
in running time pT{n,m). 

For quadratic cost functions and also for Fisher markets, this subroutine can be implemented 
by solving simple systems of equations. 
Consider now an F-tight /, and let 

errpif) = inf{A : / is (A, F)-feasible}. 

Because of continuity, this infimum is actually a minimum whenever the set is nonempty. If / is 
not (A, F)-feasible for any A, then let erri?(/) = oo. / is F-optimal if and only if / is a feasible 
flow (that is, Exbif) = 0) and errpif) =0. As / is assumed to be F-tight, FJ (A) = F U ^, the 
same arc set in ^ for any value of A. 

Assumption 4. Assume a subroutine Error('/, FJ is provided, that returns errpif) for any F- 
tight pseudoflow f in running time pE{n^m). Further, if err ^^{Q) = oo, then 0j is unbounded. 

This subroutine seems significantly harder to implement for the applications: we need to solve 
a minimum cost-to-time ratio cycle problem for quadratic costs and all pairs shortest paths for the 
Fisher markets. 

4.3 Description of the enhanced algorithm 

The algorithm starts with the (Aq, 0)-feasible solution with Fx(0) < (2n + mAr)Ao. The ap- 
propriate value can be chosen as Aq = max{err0(O),F2;b(O)/(2n + uin)}. By the second part of 
Assumption HI err0(O) must be finite or the problem is unbounded. We initialize the revealed arc 
set F = 0. 

The algorithm consists of A-phases, starting with A = Aq. In the A-phase, we shall maintain a 
linear acyclic revealed arc set F C F*, and a (A, F)-feasible F-pseudoflow /. Besides preprocessing 
and main part, the phases have a third part, Trial-and-Error, where the next value of A is also 
determined. 

No preprocessing is done in the Ao-phase. In a later phase, either we have a A-feasible pseud- 
oflow from the previous phase, or the scaling factor in the previous phase was 2A, and thus we 
have a 2A-feasible /. In the latter case, Adjust(A) provides a A-feasible pseudoflow. 

The main part of the A-phase is the same as in the basic algorithm. Let S{A) = {i V : 
Pf{i) - fei > A} and r(A) = {ieV : pf{i) - h < -A}. As long as S{A) / 0, r(A) ^ 0, send A 
units of flow from a node s G 'S'(A) to a node t G F(A) on a shortest path in EJ{A) with respect 
to the cost function C[j{fij + A). 

After the main part is finished, Extend(F, /, A) adds some arcs ij G E — F with fij > 
(2n + m-|- 1)A to F. We add all nonlinear such ij's to F, and keep adding linear arcs as long as the 
linear acyclic property is maintained. Consider a linear arc ij, which is not admitted to F because 
there exists a path P C ^ between i and j. Then we reroute the entire amount fij of flow from 
ij to P. 

If no new arc enters F, then we move to the next scaling phase with the same / and set the 
scaling factor to A/2. This is done also if F is extended, but it still has a high discrepancy: 
Db{F) > A. 

Otherwise, the subroutine Trial- and-Error(F, /, A) determines the next / and A. Based on 
the arc set F, we find a new F-pseudofiow / and scaling factor at most A/2. The subroutine may 
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also terminate with an F-optimal solution, which enables us to find an optimal solution to (jP]) by 
a maximum flow computation due to Lemma 14.21 



Algorithm ENHANCED Convex Flow 

/ ^ 0; A ^ max{errg,{f), Exb{f)/{2n + mAr)}; 

repeat //A-phase 

if / is not A-feasible then Adjust(A); 
do //main part 

S{A)^{ieV : pf{i)-bi>A}; 
T(A) ^{ieV: pf{i) -h< -A}; 

P ^ shortest s - t path in Ef with s G 5(A), t £ T(A); 

send A units on P from s to t; 
while 5(A), r(A) / 0; 
Extend(F,/, A); 

if F was extended and {Db{F) < A) then Trial-and-Error(F, /, A) 
else A ^ A/2; 



The Trial-and-Error subroutine 

The subroutine assumes that the discrepancy of F is small: Dij(F) < A. 

Step 1. First, modify b to b: in each (undirected) component K of F, pick a node j & K and 
change bj by — X^igx^*! leave all other bi values unchanged. Thus we get a b with D~^{F) = 0. 
Trial (F, b) returns an F-tight vector /. 

Step 2. Call the subroutine Error(/,F). If 6 = 6 and errpif) = 0, then / is F-optimal. By 
Lemma 14.21 optimal solution to ([P|) can be found by a single maximum flow computation. In 
this case, the algorithm terminates. If errp^f) > A/2, then keep the original /, and go to the next 
scaling phase with scaling factor A/2. Otherwise, set f = f and define the next scaling factor as 

Anext = max{errF{f),Exb{f)/{2n + tjin)}- 

5 Analysis 

Trial-and-Error replaces / by / if errF{f) < A/2 and keeps the same / otherwise. The first 
case will be called a successful trial, the latter is unsuccessful. The following is (an almost identical) 
counterpart of Lemma 13.11 

Lemma 5.1. (i) In the A-phase, the F-pseudoflow f is an integer multiple of A on each arc 
ij G E — F, and consequently, EJ {A) = EJ . 

(a) (A, F)- feasibility is maintained in the main part and in subroutine Extend(F, /, A). 

(Hi) At the beginning of the main part, Ex{f) < (2n + mjv)A, and at the end, Ex{f) < nA. 

(iv) The main part consists of at most 2n + m-Tv flow augmentation steps. 
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Proof. For (i), f is zero everywhere in — at the beginning of the algorithm and after every 
successful trial. In every other case, the previous phase had scaling factor 2A, and thus by induction, 
the flow is an integer multiple of 2A at the beginning of the A-phase. This is maintained in the 
preprocessing, as Adjust may only modify by A. The shortest path augmentations also change 
the flow by A. This imphes Ef (A) = Ef . 

For (ii), P is a shortest path if there exists potentials vr verifying (j4]) with ttj — tTj = C'^j {fij + A) 
on each arc ij € P. We show that when augmenting on the shortest path P, is maintained 
with the same vr. If neither of is in P, then it is trivial as the flow is not changed on ij. 
If ij G P, then the new flow value will be fij + A, hence we need nj — vTj < C[j{fij + 2A), 
obvious as C[j is monotonely increasing. Finally, if ji € P, then the new flow is fij — A, and thus 
we need tTj — tTj < C[j{fij). By ji G P we had vTj — tTj = Cj^{fji + A), which is equivalent to 
TTj — TTj = Clj{fij — A), implying again the claim. 

In subroutine Extend, we reroute the flow fij from a linear arc ij if ^ contains a directed 
path P from i to j. This cannot affect feasibility since the C'.'s are constant on linear arcs. Also 

note .hat arcs have i„«„i.e..«idual capacities. 

For (Hi), Ex{f ) < nA as the main part terminates with either 5'(A) = or T(A) = 0. If the 
original flow / is kept and the next scaling factor is A/2, then Ex{f) < 2n(A/2) at the beginning 
of the next phase. Adjust(A/2) increases the excess by at most A/2 on each nonlinear arc, and it 
does not change values on linear arcs. If a successful trial replaced A by Anext, then by definition, 
the new excess is at most (2n + ?n,Ar)A„ea't, and ADJUST(A„ea;t) does not change anything as the 
flow is already A^ext-feasible. (Hi) immediately implies (iv), as each flow augmentation decreases 
Ex{f) by A. □ 

Lemma 5.2. F Q F* holds in each step of the algorithm. 

Proof. The proof is by induction. A new arc ij may enter F if fij > (2n + m + 1) A for a (A, F)- 
feasible /. We shall prove that fij > for some P-optimal solution, and thus Lemma 14.21 gives 
ij G F*. 

After the phase when ij entered, let us continue with running the basic algorithm in all later 
phases: we do not extend F and do not perform Trial- and-Error in any of the later phases, and 
always choose the next scaling factor as A/2. In a A'-phase, the flow is modified by at most A' on 
ij during preprocessing and (2n + mAr)A' during the main part by Lemma fS.lT v). Consequently, in 
all phases after A, the total modification is bounded by (2n+m+l)(A/2+A/4-|-...) < (2n+m-|-l). 

If we leave the algorithm running forever, it converges to the P-optimal solution /*. By the 
above observation, f*j > 0. □ 

The next lemma is of key importance. 
Lemma 5.3. In Trial-and-Error(F, /, A), errp{f) < 2(2n + m + 4)mA. 

Before proving the lemma, we show how it provides the strongly polynomial bound. 

Theorem 5.4. The enhanced algorithm terminates in at most 0((n + mAr) log m) scaling phases. 

Proof. The set of revealed arcs can be extended at most rriN + n — 1 times, since there can be 
at most (n — 1) linear arcs because of the linear acyclic property. We shall show that after any 
A-phase, a new arc is revealed within 2[log2 T~\ phases, for T = 8(2n + m + 4)m. As A decreases 
by at least a factor of two between two phases, after [log2T] steps we have At < A/T. Assume 
that in the At phase, we still have the same revealed arc set F as in the A-phase. 
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Assume first Di,{F) > A. At the end of the main part of the A^-phase, Dh{F) > (2n + m + 
2)mAx. Thus there is a connected component K of F with | X^ieA' ^ (^'^ + m + 2)mAT. We 
have 



\PfiK)\ 



> 



Exbif) > (2n + m + 2)m/S.T - uAt > (2n + m + IjmA^. 



There must be an arc ij entering or leaving K with fij > 2n + m + 1, a contradiction as at least 
one such arc must be added to F in Extend (F, /, At)- 

Assume next Di,{F) < A. We may assume that either we are at the very beginning of the 
algorithm with i*" = 0, or in a phase when F just has been extended; otherwise, we could consider 
an earlier phase with this property. We can interpret the initial solution and Aq as the output 
of Trial- and-Error(O). 

If Di){F) > At, the above argument shows that within the next |'log2T] steps, F shall be 
extended. Otherwise, we can apply the analysis of the Trial-and-Error subroutine for the 
Ay-phase. (Even if the subroutine is not actually performed, its analysis is valid provided that 
A(F)<^At.) 

Let / be the arc set found by Trial (F, b). This is the same in the A and the A^-phase (we may 
assume that b is modified to b always the same way for the same F) . In the event of an unsuccessful 
trial in the A-phase, A/2 < errF{f). Using Lemma 15.31 for the Ay-phase, 

errpU) < 2(2n + m + 4)mAT < A/4 < errF{f)/2, 

a contradiction. On the other hand, if we had a successful trial in the A-phase, then Ay < 
lAnextlT. Also, Ex^^f) < nDb{F) < nAy. Thus 

Anext = max{erry(/),£^Xfe(/)/(2n + ttin)} < 2(2n + m + 4)mAy < Anextf^, 

a contradiction again. □ 

Some preparation is needed to prove Lemma [ 



Lemma 5.5. For a linear acylic arc set F C E, letx andy be two F -tight vectors. Then H^— y||oo < 
\\Px - PyWi holds. 

Proof. First, we claim that the difference graph Dx^y = {V,Ex^y) is acyclic. Indeed, if there existed 
a cycle C C Ex,y, then we get = EafeeC C'afe(^ab) = T^abeC ^'abiVcib)- As Xab > yah for every 
ab G C, this is only possible if all arcs of C are linear, contradicting the linear acyclic property of 
F. (Note that E^^y C ^). 



Define the function g by gij = Xij — yij > for ij G E^^y (again with the convention Xji 



Xin 



Uji = ~yij if U & E). pg = px — Py, therefore we have to prove gij < \\pg\\i for ij € E^^y This 
property indeed holds for every positive g with acyclic support. 

Consider a reverse topological ordering wi, . . . oiV, where VpVq G E^^y implies p > q. For 
the arc ij G Ex^y, let i = Vt' and j = vt {t' > t). Let Vt = {vi, . . . , vt}. Vt is a directed cut in Ex^y, 
thus 

9vpV, =Ypg{Vp). 
p>t>q p<t 

As g is positive on all arcs, this implies gvav^ < 'l2p<t Pai^p) — \\Pg\h fo^ such arcs, in particular, 
for ij. □ 
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Claim 5.6. // / and f are F-pseudoflows with fij = for ij ^ E — F, and f is {A, F)- feasible, 
then f is A + [1/ — /I [oo feasible. 

Proof. There is a potential vr so that / and vr satisfy (jH, that is, vrj— vTj < C^^(/jj+A) if ij € EJ{A). 

For a = II/ — /Ilocwe have fij + A < fij + A + a. Consequently, ^ is satisfied for fij, vr and 

A + a for every arc in Ef{A). 

■' ^ ^ 

By the assumption that / is zero outside F, we have Ef{A + a) = E U F C Ef{A) and thus 

the claim follows. □ 

of Lemma \5.3l f is (A, i^)-feasible with some potential vr. We claim that there is an F-tight / so 
that \fij — fij\ < A for every ij £ F, and Exf,{f) < (2n + m + 2)mA. 
Indeed, (A, F)-feasibility gives 

Cljifij - A) < TTj - TTi < C[j{fij + A) Vii G F 

As C[j is continuous, there must be a value fij — A < (3 < fij + A with C[j{j3) = tTj — vTj. Let us 
set fij = j3. This increases Exh{f) by at most \F\A. 

Let us set fij = for ij £ E — F. Using that fij < {2n + m + 1)A if ij ^ F and Exi,{f) < nA, 
we obtain an F-tight / with 

Exbif) <nA + \F\A + (2n + m + l)(m - |F|)A 

< (2n + m + 2)mA. 

On the other hand, Exhif) ^ nDh{F) < nA. Consequently, 

||p/-p/||i < 11/0/ - bill + \\pf -b\\i = 2Exb{f) + 2Exb{f) < 2(2n + m + 3)mA. 

Applying Lemma 15.51 for x = f and y = f gives ||/ — /||oo ^ 2(2n + m + 3)mA. Now / is 
2(2n + m + 4)mA-feasible by Claim \5M □ 

Theorem 5.7. Zei ps{n,m) be the running time needed for one shortest path computation. Then 
the running time of the algorithm is bounded by 

0{{n + mN){pT{n,m) + pE{n,m)) + (n + niN)'^ ps{n,m.) \ogm). 

Proof. By Theorem 15.41 there are at most (n + m^r) log m scaling phases, each dominated by 
0{n + mjv) shortest path computations. The subroutine Trial-and-Error is performed only 
when F is extended, that is, at most n + mj^ times, and performs the subroutines Trial and 
Error. □ 

We may not use O(logn) = O(logm) as the graph is allowed to contain parallel arcs. 

Remark 5.8. A shortest path computation can be performed in time psin, m) = 0{m+n log n), see 
1^. Recall that the original problem instance was on n' nodes and m' arcs, and it was transformed 
to an uncapacitated instance on n = n' + m' nodes and m = 2m' arcs. However, as in Orlin's 128^ 
algorithm, we can use the bound 0{m' + n' log n') instead of 0{m' + m' log n') because shortest path 
computations can be essentially performed on the original network. 
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6 Applications 

6.1 Quadratic convex costs 

Assume that Cij{a) = CijO^ + dija for each ij G E, with Cij > 0. This clearly satisfies Assumption [1] 
since C'-{a) = 2cija + dij. Also, Assumption [2] is satisfied. 

The subroutine Trial(F, b) can be implemented by solving a system of linear equations. 

vTj - vTj = 2cijXij + dij Mij G F (6) 

Xji - ^ Xij =hi Vi G y 

To verify Assumption [3l we show that this system is solvable if F is linear acyclic and Db{F) = 0. 
Clearly, we can solve the system separately on different connected components of F. In the sequel, 
let us focus on a connected component K. 

Consider first the case when all arcs are linear. Then we can solve the equalities corresponding 
to arcs and nodes separately. As F is assumed to be linear acyclic, it forms a tree. If we fix one ttj 
value arbitrarily, it determines all other vTj values by moving along the arcs in the tree. The Xij^s 
can be found by solving a fiow problem on the same tree with the demands bi. Both tasks can be 
performed in linear time. 

Assume next both linear and nonlinear arcs are present, and let T be a connected component of 
linear arcs. As above, all ttj — vTj values for i,j G T are uniquely determined. If there is a nonlinear 
arc ij G F with i,j G T, then also uniquely determined. We can remove this arc by 

replacing hi hy bi + a and bj by bj — a. Hence we may assume that the components of linear arcs 
span no nonlinear arcs. 

Next, we can contract each such component T to a single node t by setting bt = YlieT ^* 
modifying the dij values on incident arcs appropriately. A solution to the contracted problem can 
be extended to the original instance. 

For the rest, we can assume all arcs are nonlinear, that is, Cij > for all ij G F. Let A be the 
node-arc incidence matrix of F on component K: Ai^ij = — 1, Aiji = 1 for all ij G F, and all other 
entries are 0. Let C be the \F\ x \F\ diagonal matrix with Cij^ij = —2cij. ([6]) can be written in the 
form 



This can be transformed into 




where L is the weighted |-fir| x \K\ Laplacian matrix with La = Xlj.jjg^ 2c~^ ~ -^i* ~ ~2c~ 
ij G F and Lij = otherwise, and b' is an appropriate vector with YlieK K ~ ^■ 

The main task is to solve the system Ln = b' . It is well- know that in the component K oi F, 
L has rank \K\ — 1 and the system is always solvable whenever Yli^K K ~ ^- ^ solution can be 
found in 0(n^ '^^) time [5]. All previously described operations can be done in 0{m) time, hence 
we obtain pE{n,m) = 0(n^'^^). 

To implement Error(/, F), we have to find the minimum A- value such that there exists a tt 
potential with 

TTj — VTj < (2cijXij + dij) + 2cijA yij £ E U F . (7) 

We show that this can be reduced to the minimum-cost-to-time ratio cycle problem (see [1, Chapter 
5.7]). In a directed graph, there is a cost function pij and a time Tij > associated with each 
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arc. The aim is to find a cycle C minimizing (X]ijg(7^'«i)/(Sij6C "^y)- ^ strongly polynomial 
algorithm was given by Megiddo |22tl23j that solves the problem in min{0(n^ log^ n), 0{n log n(n^ + 
m log log n))} time. The problem can be equivalently formulated as 

min fi s. t. there are no negative cycles 

for the cost function pij — fiTij . (8) 

Our problem fits into this framework with pij = 2cijXij + dij and Tjj = 2cij. In ([8]), the optimal 
fj, value is —A. However, [22] defines the minimum ratio cycle problem with Tij > for every 
ij G E. This property is not essential for Megiddo's algorithm, which uses a parametric search 
method for fj, to solve dH) under the only (implicit) restriction that the problem is feasible. 

In our setting > holds for nonlinear arcs, but Tij = for linear arcs. Also, there can be 
cycles C with Ylaec '^v ~ ^- (This can happen even if F is linear acyclic, as C can be any cycle in 

E U F .) If we have such a cycle C with YlijecPij ^ 0' then ([8]) is infeasible. In every other case, 
the problem is feasible and thus Megiddo's algorithm can be applied. 

For this reason, we first check whether there is a negative cycle with respect to the pij's in the 
set of linear arcs in E L) F . If there exists one, then ([7]) is infeasible, thus A = oo, and ([P]) is 
unbounded as we can send arbitrary flow around this cycle. Otherwise, we have Y^^^c Tij > for 
any cycle with Yliij&cPv ^ 0' ^'^^ consequently, there exists a finite A satisfying (j7]). 

Consequently, pT{n,m) = min{0(n^ log^ n), 0(nlogn(n^ + mloglogn))}. Theorem 15.71 gives 
the following running time bound. 

Theorem 6.1. For convex quadratic objectives on an uncapacitated instance on n nodes and m 
arcs, the algorithm finds an optimal solution in 0{m{n^ log^ n + mlogm{m + nlogn))). For a 
capacitated instance, the running time can be bounded by 0(m'* log m). 

The bottleneck is clearly the m minimum-cost-to-time computations. As in Remark 15. 8^ it is 
likely that one can get the same running time 0{m{n^ log^ n+m log m{m+n log n))) for capacitated 
instances as well analyzing Megiddo's algorithm. 

6.2 Fisher's market with hnear utihties 

In the linear Fisher market model, we are given a set B of buyers and a set G of goods. Buyer i 
has a budget mi, and there is one divisible unit of each good to be sold. For each buyer i £ B and 
good j € G, Uij > is the utility accrued by buyer i for one unit of good j. Let n = \B\ + \G\; let 
E be the set of pairs {i,j) with Uij > and let m = \E\. We assume that there is at least one such 
pair incident to each i and to each j. 

An equilibrium solution consist of prices pi on the goods and an allocation Xij, so that (i) all 
goods are sold, (ii) all money of the buyers is spent, and (Hi) each buyer i buys a best bundle of 
goods, that is, goods j maximizing Uij/pj. 

The classical convex programming relaxation of this problem was given by Eisenberg and Gale 
[7] in 1959. Recently, Shmyrev [31] gave the following alternative relaxation. The variable fij 
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represents the money payed by buyer i for product j. 

mm^Pj{logpj - 1) - ^ fijlogUij 

i&G ij€E 

^ fij =mi yi e B 
fi,>0 yijeE 

Let us construct a network on node set B U G L) {t} as follows. Add an arc ij for every i € B, 
j & G with Uij > 0, and an arc jt for every j G G. Set bi = —rrii for i £ B, bj = for j £ G and 
bt = YlieB'^^i- lower arc capacities be and upper arc capacities oo. With pj representing 

the flow on arc jt, the above formulation is a minimum-cost flow problem with separable convex 
objective. In this section, the convention pj = fjt shall be used for some pseudoflow / in the above 
problem. 

Let us justify that an optimal solution gives a market equilibrium. Let / be an optimal solution 
that satisfies ([1]) with vr : B Li G U {t} — )• M. We may assume vr^ = 0. Gj^{a) = log a implies 
TTj = — logpj. On each ij £ E we have iTj — iTi < — log Uij with equality if fij > 0. With /3j = e'^' , 
this is equivalent to Uij/pj < f^i, verifying that every buyer buys a best bundle of goods. 

Assumption [2] is clearly satisfied, however. Assumption [T] needs oracle access to the derivatives. 
On the arcs ij between buyers and goods, this is — log C/jj , while on an arc jt it is log fjt-, hence we 
cannot determine the exact values. 

The derivative oracle is used in two parts of the algorithm: for computing shortest paths between 
5(A) and T'(A) in the main part, and for the subroutine Adjust(A) in the preprocessing part. We 
can solve both these problems provided the rational values e'^'J*-"^. For shortest path computations, 
we use a multiplicative version of Dijkstra's algorithm, computing products instead of sums on the 
arcs. For Adjust(A), we need a potential vr verifying that / is (A, F)-feasible. Instead of the vTj 
values we can use the e"^ values, that can be chosen as rational numbers. 

Let us turn to Assumption [3l When the subroutine Trial is called, we transform 5 to 6 by 
changing the value at one node of each component K of F. For simplicity, let us always modify bt 
lit £ K. We shall verify Assumption [3] only for such 6's; the argument can easily be extended to 
arbitrary b (although it is not necessary for the algorithm). Let us call the component K containing 
t the large component. 

In TRiAL(i^), we want to find a potential tt : i? U G U {t} ^ M U {oo}, money allocations fij for 
ij £ F, i G B, j £ G, and prices pj = fjt for jt € F such that 

TTj - TTi = - log Uij Vij e F,i e B,j e G 

TTt-TTj =logPj yjt e F 

P3= E ^^^^^ 

i£B,ij£F 

We may again assume vrt = 0. Let Pj = e~'^^ for j & G and Ri = e'^'- for i £ B. With this notation, 
Uij /Pj = Ri for ij e F. If jt € F, then Pj = pj. 

Finding / and vr can be done independently on the different components of F. For any compo- 
nent different from the large one, all arcs are linear. Therefore we only need to find a feasible flow 
on a tree, and independently, Pj and Ri values satisfying Uij/Pj = Ri on arcs ij in the component. 
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Both of these can be performed in hnear time in the number of arcs in the tree. Note that mul- 
tiplying each Pj by a constant q > and dividing each i?j by the same a yields another feasible 
solution. 

Let Ti, . . . , Tfc be the components of the large component after deleting t. If Ti contains a single 
good j, then we shall set pj = Pj = {ttj = 0). If is nonsingular, then F restricted to Ti forms 
a spanning tree. The equalities Uij/Pj = Ri uniquely define the ratio Pj/Pj' for any j,j' G GflT^. 
Using that pj = Pj and Yli&BnTi^i ~ Yliji^Gr\TiPo-> ^^^^ uniquely determines the prices in the 
component. Then the arcs in F simply provide the allocations fij. All these computations can be 
performed in pT{n,m) = 0{m) time. 

For Assumption^ we show that Error(/, F) can be implemented based on the Floyd- Warshall 
algorithm. Let vr be the potential witnessing that / is (A, F)-feasible. Assuming tt^ = 0, and using 
again the notation Pj = e~'^^ for j £ G and Ri = e"* for i G we get Uij/Pj <RiifiGB,j€G, 
ij G E, with equality if ji G EJ . Furthermore, we have pj — A < Pj < pj + A if pj > and Pj < A 
if Pj = 0. 

Let us now define /3 : G x G — )• M as 



Pjj' = max <j jj^ : i G B,ji, if G EJ 



h] 

If no such i exists, define Pjji = 0; let fijj = 1 for every j G G. 

Clearly, Pji > Pjfijj' must hold to guarantee that arcs ij with ji G EJ have the best bang-per- 
buck values. Conversely, if the prices Pj satisfy this property, then we can set the Ri values as best 
bang-per-buck ratios, thus satisfying the necessary inequalities on all arcs. 

If there is a directed cycle G with Habi^cPab > 1, then / cannot be (A, F)-feasible for any A. 
Otherwise, we may compute f3jji as the maximum of Ii-abep(^ab over all directed paths P from j to j' 
(setting the value again if no such path exists). This can be done by the multiplicative version of 
the Floyd- Warshall algorithm in 0{n^) time (note that this is equivalent to finding all-pair shortest 
paths for — log (3ab)- 

For (A, F)-feasibility, we clearly need to satisfy 

(Pj - < PjPn' < Pf < Pj' + ^- 

Let us define A as the smallest value satisfying all these inequalities, that is, 

A = max I 0, max P^^J^'-P^' I . (9) 
I ^'^"^^ l^n' + 1 J 

We claim that / is (A, F)-feasible with the above choice. For each j G G, let Pj = max/jgc (3hj{ph — 
A). It is easy to verify that these P values satisfy Pj' > PjPjj', and pj — A < Pj < pj + A. 

The complexity of Error(/, F) is dominated by the Floyd- Warshall algorithm, 0{n^) [8j. The 
problem is defined on an uncapacitated network, with the number of nonlinear arcs mj\[ = \G\ < n. 
Thus Theorem 15.71 gives the following. 

Theorem 6.2. For Fisher's market with linear utilities, the algorithm finds an optimal solution in 
0(n^ -|- n^{m + nlogn) logn). 

The algorithm by Orlin |29) runs in 0(n^ logn) time, assuming m = 0{n'^). Under this assump- 
tion, we get the same running time bound. 
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6.3 Fisher's market with spending constraint utihties 



The spending constraint utility extension of linear Fisher markets was defined by Vazirani 
In this model, the utility of a buyer decreases as the function of the money spent on the good. 

1 9 i - - 

Formally, for each pair i and j there is a sequence Ufj > Ufj > . . . > U,-j^ > of utilities with 

numbers L^j, . . . , L-j > 0. Buyer i accrues utility Ulj for every unit of j he purchased by spending 
the first Lj- dollars on good j, Uf, for spending the next L?- dollars, etc. These lij intervals 

''J ''J *J •' 

corresponding to the pair ij are called segments, iij = is allowed, but altogether at least one 
segment is required to be incident to each good i and to each buyer j. Let n = + |G| denote the 
total number of buyers and goods, and m denote the total number of segments. Note that m > n? 
is also possible. 

No extension of the Eisenberg-Gale convex program is known to capture this problem. The 
existence of a convex programming formulation is left as an open question in [35] . This was settled 
by Devanur et al. [2], showing that Shmyrev's formulation naturally extends here. Let fj'j represent 
the money payed by buyer i for the /c'th segment of product j, 1 < k < £ij. 



min pj (log Pj-l)- Yl 4 ^ii 



1-3 

ieG i€B,j€G,l<k<eij 



^<ff,<L% \fijeE. 

This gives a convex cost flow problem again on the node set BUGU {t}, by adding £ij parallel 
arcs from i £ B to j G, and arcs jt for each j G G. The upper capacity on the fe'th segment 
for the pair ij is Lfj. To apply our method, we first need to transform it to an equivalent problem 
without upper capacities. This is done by replacing the arc representing the /c'th segment of ij by a 
new node {ij, k) and two arcs i{ij, k) and j{ij, k). The node demand on the new node is set to L^p 
while on the good j, we replace the demand by — ^ L^^-, the negative of the sum of capacities 
of all incident segments. The cost function on i{ij,k) is — logU^ja, while the cost of j{ij,k) is 0. 
Let 5 denote the set of the new {ij, k) nodes. This modified graph has n' = n + m + 1 nodes and 
m' = 2m + |G| arcs. 

Assumptions [1] and [2] are satisfied the same way as for linear Fisher markets, using an oracle 
for the e'^'i^"^ values. 

In Trial (F), we want to find an F-tight fiow /' on the extended network, witnessed by the 
potential tt : B U S U G Li {t} ^ R. We may assume nt = 0. Let Pj = 6""^ for j G G and Ri = e'"' 
for i G S and Sfj = e-""^-^*) . For the fc'th segment of ij, Uj'j/Sfj < Ri and Sfj > Pj. If i{ij, k) e F 
then Uf'j/Pj > Ri and if k) e F then Uj'j/Pj < Ri . 

As for linear Fisher markets, if a component of F does not contain t, we can simply compute 
all potentials and flows as F is a spanning tree of linear arcs in this component. 

For the component K with t £ K, let Ti be a component oi K — t. F is a spanning tree of linear 
arcs in as well, therefore the ratio Pj/Pji is uniquely deflned for any j,j' € GnTg. On the other 
hand, we must have Pj = pj, and we know that YljeGnTiPj ~ ~YlveTi conservation. 
These determine the Pj = pj values, and thus all other Ri and Sfj values in the component as well. 
The support of the flow fij is a tree and hence it can also easily computed. The running time of 
Trial is again linear, pT{n',m') = 0{m') = 0{m). 
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Error(/, F) can be implemented the same way as for the Hnear Fisher market. We shall define 
the values /? : G x G — )• M so that Pj' > Pj/3jj' must hold, and conversely, given Pj prices satisfying 
these conditions, we can define the Ri and S^j values feasibly. Let 

Pjj' = ^^^{rjk '■ ^ ^ BJ{ij,k),{ij,k)i,i{ij',k'),{ij',k')j' G 

ij 

Given these values, the values can be computed by the Floyd- Warshall algorithm and the 
optimal A obtained by Q as for the linear case. 

Finding the Pjj' values can be done in 0{m') time, and the Floyd- Warshall algorithm runs in 
0(|Gp). This gives pE{n',m') = 0{m' + \G\^) = 0{m + n^). From Theorem 15.71 together with 
Remark 15.81 we obtain: 

Theorem 6.3. For an instance of Fisher's market with spending constraint utilities with n = 
\B\ + \G\ and m segments, the running time can be bounded by 0{mn?' + m'^{m + nlogn) logm). 

7 Discussion 

We have given strongly polynomial algorithms for a class of minimum-cost flow problems with 
separable convex objectives. This gives the first strongly polynomial algorithms for quadratic 
convex cost functions and for Fisher's market with spending constraint utilities. For Fisher's 
market with linear utilities, we get the same complexity as Orlin |29j . 

The bottleneck in complexity of all applications is the subroutine Trial. However, the exact 
value of errp{f) is not needed: a constant approximation would also yield the same complexity 
bounds. Unfortunately, no such algorithm is known for the minimum cost-to-time ratio cycle 
problem that would have significantly better, strongly polynomial running time. Finding such an 
algorithm would immediately improve the running time for quadratic costs. 

A natural future direction could be to develop strongly polynomial algorithms for quadratic 
objectives and constraint matrices with bounded subdeterminants. This would be a counterpart of 
Tardos' |34] for linear programs. Such an extension could be possible by extending our techniques 
to the setting of Hochbaum and Shantikumar . 

The recent paper [37] shows that linear Fisher market, along with several extension, can be 
captured by a concave extension of the generalized flow model. A natural question is if there is any 
direct connection between the concave generalized flow model and the convex minimum cost flow 
model studied in this paper. Despite certain similarities, no reduction is known in any direction. 
Indeed, no such reduction is known even between the linear special cases, that is, generalized flows 
and minimum-cost flows. The perfect price discrimination model by Goel and Vazirani [llj, and the 
Arrow-Debreu Nash-bargaining problem by Vazirani [36], are instances of the concave generalized 
flow model, but they are not known to be reducible to convex cost flows. On the other hand, the 
spending constraint utility model investigated in this paper is not known to be reducible to concave 
generalized flows. 

The algorithm in [37] is not strongly polynomial. Moreover, no strongly polynomial algorithm 
is known for linear generalized flows, despite the huge literature on polynomial time algorithms. 
Developing a strongly polynomial algorithm for generalized flows is a fundamental open question. 
Resolving it could lead to strongly polynomial algorithms for the market problems that fit into the 
concave generalized flow model. 

A related problem is finding a strongly polynomial algorithm for minimizing a separable convex 
objective over a submodular polyhedron. Fujishige ^lOj showed that for separable convex quadratic 
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costs, this is essentially equivalent to submodular function minimization. Submodular utility al- 
location markets by Jain and Vazirani {20] also fall into this class, and are solvable in strongly 
polynomial time; see also Nagano ^26j . Other strongly polynomially solvable special cases are given 
by Hochbaum and Hong |13| . 

A common generalization of this problem and ours is minimizing a separable convex objective 
over a submodular flow polyhedron. Weakly polynomial algorithms were given by Iwata [TS] and by 
Iwata, McCormick and Shigeno [19]. One might try to develop strongly polynomial algorithms for 
some class of separable convex objectives; in particular, for separable convex quadratic functions. 
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